Using Syntactic Dependency-Pairs Conflation to Improve Retrieval Performance in Spanish
نویسندگان
چکیده
This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to conflate semantically related words. At sentence level, an approximate grammar is used to conflate syntactic and morphosyntactic variants of a given multi-word term into a common base form. Experimental results show remarkable improvements with regard to classical indexing methods.
منابع مشابه
Towards the Development of Heuristics for Automatic Query Expansion
In this paper we study the performance of linguisticallymotivated conflation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term conflation and the extraction of syntactic dependency pairs for multi-word term conflation. These techniques have been tested on several search engines implementing ...
متن کاملUsing syntactic dependency - pairs con ationto improve retrieval performance in Spanish ?
This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to connate semantically related words. At sentence level, an approximate grammar is used to connate syntactic and morphosyntactic variants of ...
متن کاملOn the Usefulness of Extracting Syntactic Dependencies for Text Indexing
In recent years, there has been a considerable amount of interest in using Natural Language Processing in Information Retrieval research, with specific implementations varying from the word-level morphological analysis to syntactic parsing to conceptual-level semantic analysis. In particular, different degrees of phrase-level syntactic information have been incorporated in information retrieval...
متن کاملTowards the development of heuristics
In this paper we study the performance of linguistically-motivated connation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term connation and the extraction of syntactic dependency pairs for multi-word term connation. These techniques have been tested on several search engines implementing di...
متن کاملAn evaluation of conflation accuracy using finite-state transducers
Purpose – To evaluate the accuracy of conflation methods based on finite-state transducers (FSTs). Design/methodology/approach – Incorrectly lemmatized and stemmed forms may lead to the retrieval of inappropriate documents. Experimental studies to date have focused on retrieval performance, but very few on conflation performance. The process of normalization we used involved a linguistic toolbo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002